Usable speech measures and their fusion

نویسندگان

  • Robert E. Yantorno
  • Brett Y. Smolenski
چکیده

Usable speech is a novel concept related to the co-channel speech problem. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech is to identify and extract those portions of co-channel speech that are still useful for speech processing applications such as speaker identification or speech recognition, which do not work in cochannel environments. Usable speech measures are features that are extracted from the co-channel signal to detect the presence of usable as well as co-channel (unusable) speech. Several usable speech measures are currently being developed; however, these measures detect only about 75% of the usable speech. To improve on this performance, nonlinear estimation and Bayesian classification are used to fuse the information in two recently proposed usable speech measures. Using fusion resulted in a 15% increase in hits (usable speech frames detected) and a 37% decrease in false alarms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Local Linear Wavelet Neural Network and RLS for Usable Speech Classification

While operating in a co -channel environment, the accuracy of the speech processing technique degrades. When more than one person is talking at same time, then there occurs the co-channel speech. The objective of usable speech segmentation is identification and extraction of those portions of co-channel speech that are degraded in a negligible range but still needed for various speech processin...

متن کامل

A Persian Cued Speech Website Fromthe Deaf Professionals’ Views

Objectives: Increasingly people are using the internet to find information about medical and educational issues and one of the simplest ways to obtain information is internet. Persian Cued Speech is a very new system to Iranian families with deaf child and the professionals and a few educators have enough knowledge about it, so the purpose of this study was to introduce Persian Cued Speech webs...

متن کامل

Speech intelligibility after repair of cleft lip and palate

    Background: Intelligibility refers to understandability of speech; and lack of it can negatively affect children’s overall communication effectiveness. Children with repaired cleft lip and/or cleft palate (CL/P) may experience poor speech intelligibility. This study aimed at evaluating speech intelligibility in children with repaired CL/P who had not been referred to sp...

متن کامل

Structure-based Speech Classifcation Using Non-linear Embedding Techniques

Usable speech” is referred to as those portions of corrupted speech which can be used in determining a reasonable amount of distinguishing features of the speaker. It has previously been shown that the use of only voiced segments of speech improves the usable speech detection system, and also, that unvoiced speech does not contributes significant information about the speaker(s) for speaker ide...

متن کامل

Developing usable speech criteria for speaker identification technology

Recently, a “usable speech” extraction system [1] was proposed to separate co-channel speech into “usable” frames that are minimally corrupted by interfering speech. Studies indicate [2] that a significant amount of cochannel speech can be considered “usable” for speaker identification (SID). Therefore, it is necessary to establish criteria for usable speech frames for SID. Voiced speech, of wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003